Improved Transcription of Czech Parliament Speeches by Acoustic and Language Model Adaptation
نویسندگان
چکیده
The aim of this work is to improve the accuracy of our spoken broadcast transcription system in the task of Czech parliament speeches recognition. To achieve this goal, we propose several approaches for adaptation of both acoustic and language models of our system: a new two step unsupervised speaker adaptation strategy is presented to improve the former model while the latter one is created from a text corpus mixed properly from both general (2.6 GB of Czech newspaper texts) and domain specific data (181 MB of parliament speeches). Our experimental results show that the combination of both adaptation approaches leads to near 30% relative reduction of WER in comparison with the baseline speaker independent (SI) system operating with a general language model.
منابع مشابه
Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive
In this paper, we present the outcome of a 4-year project whose ultimate goal is to develop a complex platform that can transcribe, index and make searchable the historical archive of Czech and Czechoslovak Radio. The archive covers 90 years of public broadcasting and contains hundreds of thousands audio documents. The developed modular platform employs our LVCSR system that has to cope with 2 ...
متن کاملFully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon
We present a system developed for fully automated processing of Czech spoken broadcast programs. It includes modules for unsupervised segmentation of audio stream, speaker and gender recognition followed by speaker adaptation, and own speech decoder designed for extremely large vocabularies. Compared to our previous results reported in 2004, the new system reduced the WER (evaluated on the Czec...
متن کاملUsing Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives
This paper deals with unsupervised feature-based speaker adaptation techniques. The goal is to design an optimal adaptation approach for improving the recognition accuracy of a LVCSR system developed for automatic transcription of large archives of spoken Czech (e.g. the archive of the parliament talks, historical archives of Czech broadcast stations, etc.) For this purpose, several modificatio...
متن کاملDomain Adaptation of a Broadcast News Transcription System for the Portuguese Parliament
The main goal of this work is the adaptation of a broadcast news transcription system to a new domain, namely, the Portuguese Parliament plenary meetings. This paper describes the different domain adaptation steps that lowered our baseline absolute word error rate from 20.1% to 16.1%. These steps include the vocabulary selection, in order to include specific domain terms, language model adaptat...
متن کاملThe ISL 2007 English speech transcription system for european parliament speeches
The project Technology and Corpora for Speech to Speech Translation (TC-STAR) aims at making a break-through in speech-to-speech translation research, significantly reducing the gap between the performance of machines and humans at this task. Technological and scientific progress is driven by periodic, competitive evaluations within the project. In this paper we describe the ISL speech transcri...
متن کامل